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OPTIMAL RANK-BASED TESTS FOR HOMOGENEITY 

OF SCATTER 

By Marc Hallin 1 and Davy Paindaveine 2 

Universite Libre de Bruxelles 

We propose a class of locally and asymptotically optimal tests, 
based on multivariate ranks and signs for the homogeneity of scatter 
matrices in m elliptical populations. Contrary to the existing para- 
metric procedures, these tests remain valid without any moment as- 
sumptions, and thus are perfectly robust against heavy-tailed dis- 
tributions {validity robustness). Nevertheless, they reach semipara- 
metric efficiency bounds at correctly specified elliptical densities and 
maintain high powers under all {efficiency robustness). In particular, 
their normal-score version outperforms traditional Gaussian likeli- 
hood ratio tests and their pseudo- Gaussian robustifications under a 
very broad range of non-Gaussian densities including, for instance, 
all multivariate Student and power-exponential distributions. 

1. Introduction. 

1.1. Homogeneity of variances and covariance matrices. The assumption 
of variance homogeneity is central to the theory and practice of univariate 
m-sample inference, playing a major role in such models as m-sample loca- 
tion (ANOVA) or m-sample regression (ANOCOVA). The problem of test- 
ing the null hypothesis of variance homogeneity, therefore, is of fundamental 
importance, and for more than half a century, has been a subject of contin- 
ued interest in the statistical literature. The standard procedure, described 
in most textbooks, is Bartlett's modified (Gaussian) likelihood ratio test 
(see [2]). This test, however, is well known to be highly nonrobust against 
violations of Gaussian assumptions, a fact that gave rise to a large number 
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of "robustified" versions of the likelihood ratio procedure ([3, 5, 6, 21], to 
quote only a few). Soon, it was noticed that these "robustifications," if rea- 
sonably resistant to nonnormality, unfortunately were lacking power: in the 
convenient terminology of Heritier and Ronchetti [22], they enjoy validity 
robustness but not efficiency robustness. 

In an extensive simulation study, Conover, Johnson and Johnson [7] have 
investigated the validity robustness (against nonnormal densities) and effi- 
ciency robustness properties of 56 distinct tests, including several (signed-) 
rank-based ones. Their conclusion is that only three of them survive the ex- 
amination, and that two of the three survivors are normal-score signed-rank 
tests (adapted from [10]). 

In view of its applications in MAN OVA, MANOCOVA, discriminant anal- 
ysis, and so forth, the multivariate problem of testing for homogeneity of 
covariance matrices is certainly no less important than its univariate coun- 
terpart. The same problem moreover is of intrinsic interest in such fields as 
psychometrics or genetics where, for instance, the homogeneity of genetic 
covariance structure among species is a classical subject of investigation. 
Robustness and power issues, however, are even more delicate and complex 
in the multivariate context. 

Here again, the traditional procedure is a Gaussian modified likelihood ra- 
tio test, <^mlrT' the unbiasedness of which has been established (for general 
dimension k and number m of populations, under Gaussian assumptions) 
by Permian [31]. Unfortunately, this test shares the poor resistance to non- 
normality of its univariate counterpart, and is invalid even under elliptical 
densities with finite fourth-order moments: see [12, 38] or [41]. The test 
proposed by Nagao [27], as shown in [19], is asymptotically equivalent to 

^MLRT — s t in under finite fourth-order moments, and hence inherits of the 
poor robustness properties of the latter. Quite surprisingly, and except for 

some attempts to bootstrap <^mlrt ([H) 42, 43]), the important problem 
of testing homogeneity of covariance matrices under possibly non-Gaussian 
densities has remained an open problem for more than fifty years. 

In 2001, Schott [35] proposed a Wald test, </>g"h ott , the validity of which still 
requires Gaussian densities, but also two modified versions thereof — denote 
them as 0g"h ottjf and </>schottf — enjoying validity robustness at homokurtic 
and heterokurtic elliptical densities with finite fourth-order moments, re- 
spectively. More recently, a detailed account of the pseudo-Gaussian ap- 
proach of the problem has been given in [19], where we derive the locally 

asymptotically most stringent (in the Le Cam sense) Gaussian test (jyff , and 

(n) (n) 

show how to turn it into pseudo-Gaussian tests cj)^ and that are valid 
under homokurtic and heterokurtic m-tuples of elliptical densities with fi- 
nite fourth-order moments, respectively. We also show that </4diott* an d 4>j^l 
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(resp. ^g™hottt an d 4>m\) asymptotically coincide. Schott himself recommends 
using the homokurtic version ^"hott* °f ms test and, since heterokurtic sam- 
ples are not considered here, we will concentrate on 0g"h o tt* anc ^ 4$* as the 
only pseudo-Gaussian tests available so far for the problem under study 
(one of the findings of [19] indeed is that the bootstrapped MLRT, although 
perfectly valid, is highly unsatisfactory in the non-Gaussian case). 

This pseudo-Gaussian qualification, however, still requires finite moments 
of order four, and only addresses validity robustness issues. Although no 
multivariate equivalent of the Conover, Johnson and Johnson study [7] has 
been conducted so far, the results we are obtaining in Sections 5.2 and 6 

below, however, indicate that </>gchott* anc ^ ^JV* s * m su ff er a severe lack of 
efficiency robustness, particularly so at radial densities with high kurtosis 
such as the Student with 4 + 6 degrees of freedom (with <5 « 0). 

It is thus very likely that the conclusions of Conover et al. [7] still ap- 
ply, which strongly suggests considering a "rank-based" approach — with a 
concept of "ranks" adapted to the multivariate context. The purpose of this 
paper is to develop such an approach under elliptical assumptions, based on 
the signs and ranks considered in [15] for location, [16] for (auto)regression, 
[14] and [17] for shape. 

Contrary to all existing methods, our tests do not require any moment 
assumptions, so that the null hypothesis they address actually is that of 
homogeneity of scatter matrices, reducing to more classical homogeneity of 
covariance matrices under finite second-order moments. They are asymptot- 
ically distribution- free, reach semiparametric efficiency at correctly specified 
densities, and are both validity- and efficiency-robust. When based on Gaus- 
sian scores, their asymptotic relative efficiency with respect to the Gaussian 
and pseudo-Gaussian tests is larger than one under almost all elliptical den- 
sities (see Section 6 for details). 

1.2. Testing equality of scatter (covariance) matrices. Denote by (Xji, • • • , 
Xj ni ), i = 1, . . . , m, a collection of m mutually independent samples of i.i.d. 
/c-dimensional random vectors with elliptically symmetric densities. More 
precisely, for all i = 1, . . . , m, the observations Xy , j = 1, . . . , rii, are as- 
sumed to have a probability density function of the form 

(i.i) £(x) := CfcJl |:£r 1/2 /i(((x - 0.)X rl (x - e t )) 1/2 ), x e R k , 

for some /c-dimensional vector 6i (location), symmetric and positive definite 
(k x k) matrix Sj (the scatter matrix), and (duly standardized: see below) 
function /i:Mq" — > IR + (the radial density). The null hypothesis considered 
throughout is the hypothesis 7~Lq : Si = • • • = S m of scatter homogeneity (re- 
ducing, under finite variances, to covariance homogeneity). 
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Define (throughout, XI 1 / 2 stands for the symmetric root of E) 



(1.2) 



UgfoS,)- Si '[ (Xii Gi) and 
dij{Oi,Hi) := 1 ^ 2 (Xjj — 6i)\\. 



Denoting by n = Y^Li n i the total sample size and writing Rij{6\, . . . , 6 m , Ei, 
...,E m ) for the rank of Ej) among dn(0i, Si), . . . , d mn , m (0 mi E m ), 

consider the signed-rank scatter matrices 

s K , : = 1 f>( ^'---^; s '---' E) ) u„(d„ shj;^, s), 

Tij \ 71+1 / 

(1.3) 

i = 1, . . . , m, 

where G\,..., 9 m are consistent estimates of the various location parameters, 
E is a consistent (under TCq) estimate of the common null value of the Ej's, 
and K is some appropriate score function. The proposed signed-rank tests 
reject the null hypothesis of scatter homogeneity for large values of 

(1-4) 9 { K := l £ ("i + niOO&Ai'. 

Ki<i' <m 

where := £p^{a k}K tr[(S^ ;i - S^ ;i /) 2 ] +(3 k ,K tr 2 [S^ ;i - S^]}; 

and /9fc 5 K are constants depending on the dimension k and the score func- 
tion K. The form of (1.4) follows from Le Cam type optimality arguments, 

(n) 

but Q K also can be obtained by replacing traditional sample covariance 

matrices with the signed-rank scatter matrices (1.3) in the statistic of the 

pseudo-Gaussian test (jyjj^ derived in [19]. 

The use of signed ranks is justified by the invariance principle: TCq in- 
deed is not only invariant under affine transformations, but also under the 
group of (continuous monotone) radial transformations; see Section 3.2 for 
details. Beyond affine-invariance (all tests considered in this paper are affine- 
invariant), our rank tests — unlike their competitors — are also (asymptoti- 
cally) invariant with respect to the groups of radial transformations; their 
validity robustness actually follows from this latter invariance property. 

As announced, our methodology combines validity and efficiency robust- 
ness. We will show that for (essentially) any radial density /i, it is possible 
to define a score function K := characterizing a signed-rank test which is 
locally and asymptotically optimal (locally and asymptotically most stringent, 
in the Le Cam sense) under radial density f\. In particular, when based on 
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Gaussian scores, our rank tests achieve the same asymptotic performances 
as the optimal Gaussian tests (pff at the multinomial, while enjoying the 

validity robustness of the pseudo-Gaussian </>^ (or ^gchott*) anc ^ even more, 
since no moment assumption is required. Moreover, the asymptotic relative 
efficiencies (AREs) of these normal-score tests are almost always larger than 
one with respect to their parametric competitors; see Section 6. The class of 
tests we are proposing thus in most cases dominates the existing parametric 
ones, both in terms of robustness and power. 

1.3. Outline of the paper. The paper is organized as follows. In Section 2, 
we collect the main assumptions needed in the sequel. Section 3 discusses 
semiparametric modeling issues and their relation to group invariance. Sec- 
tion 4.1 states the uniform local asymptotic normality result (ULAN) on 
which our construction of locally and asymptotically optimal tests is based. 
In Section 4.2, we construct rank-based versions of the central sequences 
appearing in this ULAN result. In Section 5.1, we derive and study the pro- 
posed nonparametric (signed-rank) tests [based on (1.4)] and, in Section 5.2, 
for the purpose of performance comparisons, the optimal pseudo-Gaussian 
ones. Asymptotic relative efficiencies with respect to those pseudo-Gaussian 
tests are derived in Section 6. Section 7 provides some simulation results con- 
firming the theoretical ones. Finally, the Appendix collects proofs of asymp- 
totic linearity and other technical results. 

2. Main assumptions. For the sake of convenience, we are collecting here 
the main assumptions to be used in the sequel. 

2.1. Elliptical symmetry. As mentioned before, we throughout assume 
that all populations are elliptically symmetric. More precisely, defining the 
collections T of radial densities and T\ of standardized radial densities as 

F := {/ > a.e. : fi k -l;f < °°} 

and 

Fi ■= (Zi G F: t r k ~ l h(r) dr = \\, 

respectively, where /j,£.j := J^°r e f(r)dr, we require the following. 

Assumption (A). The observations Xj,-, j = 1, . . i = 1, . . . ,m, are 
mutually independent, with p.d.f. fi, i = l,...,m, given in (1.1), for some 
/1G.F1. 
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Clearly, for the scatter matrices Sj in (1.1) to be well defined, identifiabil- 
ity restrictions are needed. This is why we impose that f\ G^Fi, which implies 
that dij(6i,'Si) defined in (1.2) has median one under (1.1), and identifies 
Sj without requiring any moment assumptions (see [17] for a discussion). 
Note, however, that under finite second-order moments, £j is proportional 
to the covariance £oi of Xy , with a proportionality constant that does not 
depend on i: the hypotheses of scatter and covariance homogeneity then 
coincide. 

Special instances of elliptical densities are the fe-variate multinomial dis- 
tributions, with standardized radial density fi(r) =0i(r) := exp(— afcr 2 /2), 
the fc-variate Student distributions, with radial densities (for v € Rq~ de- 
grees of freedom) f\{r) = f\ v (r) := (1 + a^^r 2 /v)~( k+u ^ 2 , and the fc-variate 
power-exponential distributions, with radial densities of the form fi(r) = 
fi,r]( r ) := ex P( — bk^r 271 ), i] £ Ro '■> ^ ne positive constants a^, and bk tV are 
such that /i € T\ . 

The equidensity contours associated with (1.1) are hyper-ellipsoids cen- 
tered at Oi , the shape and orientation of which are determined by the scat- 
ter matrix Sj. The multivariate signs Uj,(0i,£j) and standardized radial 
distances dij{9i,Hi) defined in (1.2) are the (within-group) elliptical coordi- 
nates associated with those ellipsoids: the observation Xj,- then decomposes 

1/2 

into 6i + dijUi Ujj, where the Uy's, j = 1, . . . ,th, i = 1, . . . ,m are i.i.d. 
uniform over the unit sphere in M. k , and the cZy's are i.i.d., independent 
of the Ujj's, with common density /ifc(r) := (//fc_i ; / 1 )~ 1 r fc_1 /i(r) over IR + 
(justifying the terminology standardized radial density for /i) and distribu- 
tion function F^. In the sequel, the notation gif. and G\k will be used for 
the corresponding functions computed from a standardized radial density 



The derivation of locally and asymptotically optimal tests at radial den- 
sity fi will be based on the uniform local and asymptotic normality (ULAN) 
of the model at given fx. This ULAN property — the statement of which re- 
quires some further preparation and is delayed to Section 4.1 — only holds 
under some further mild regularity conditions on f\. More precisely, ULAN 
(see Proposition 4.1 below) requires f\ to belong to the collection T a of 
absolutely continuous densities in T\ such that, letting ipf ± := — (with 
fi the a.e. -derivative of /i), the integrals 



are finite. The quantities Ik(fi) and Jk{fi) play the roles of radial Fisher 
information for location and radial Fisher information for shape/scale, re- 
spectively (see [17]). 



5i(e^i). 
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2.2. Asymptotic behavior of sample sizes. Although, for the sake of no- 
tational simplicity, we do not mention it explicitly, we actually consider 
sequences of statistical experiments, with triangular arrays of observations 
of the form (XjJ , . . . , X n) (ri) , , . . . , X^ n) (n) , . . . , X^J , . . . , X (n) (n) ) indexed 

by the total 
assumption. 



in) 

by the total sample size n, where the sequences n\ satisfy the following 



Assumption (B). For alH = 1, .. . ,m, rii = nf 1 ' —> oo as n —> oo. 

Note that this assumption is weaker than the corresponding classical as- 
sumption in (univariate or multivariate) multisample problems, which re- 
quires that rii/n be bounded away from and 1 for all i as n — > oo. Letting 

A W : =n 4 (n) /n, it is easy to check that Assumption (B) is actually equivalent 
to the Noether conditions 

maxl t4- — , - — 7 — r ) = o(n) as n — > oo, for all i, 

V Aj n) l-\\ n) J 

that are needed for the representation result in Lemma 4.1(i) below. How- 
ever, in the derivation of asymptotic distributions under local alternatives, 
the following reinforcement of Assumption (B) is assumed to hold (mainly, 
for notational comfort): 

Assumption (B'). For alH = 1, . . . ,m, X^ — > Aj G (0, 1), as n — > oo. 

2.3. Score functions. The score functions K appearing in the rank-based 
statistics (1.3)-(1.4) will be assumed to satisfy the following regularity con- 
ditions. 



Assumption (C). The score function K : (0, 1) — ► R (CI) is a contin- 
uous, nonconstant, and square-integrable mapping which (C2) can be ex- 
pressed as the difference of two monotone increasing functions, and (C3) 
satisfies f K(u) du = k. 

Assumption (C3) is a normalization constraint that is automatically satis- 
fied by the score functions K{u) = Kf^u) := (ff x {F^ (u))F^} (u) leading to 
local and asymptotic optimality at radial density f\ (at which ULAN holds); 
see Section 4.1. For score functions K,K±,K2 satisfying Assumption (C), 
let J k (K u K 2 ) :=V[K 1 {U)K 2 {U)} and C k (K u K 2 ) :=Cav[Ki(U),K 2 (U)] = 
j7jfc(.Ki, K 2 ) — k 2 [throughout, U stands for a random variable uniformly dis- 
tributed over (0, 1)]. For simplicity, we write Jk{K) for J k {K,K), £k{K) for 
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C k (K,K), JkiKJt) for E[K(U)K fl (U)], C k {h,9i) for E[K fl (U)K gi (U)] - 
k 2 , etc. 

The power score functions K a (u) := k(a + l)u a (a > 0) provide some tra- 
ditional score functions satisfying Assumption (C), with J k (Ka) = k 2 (a + 
l) 2 /(2a + 1) and C k (K a ) = k 2 a 2 /(2a + 1): Wilcoxon and Spearman scores 
are obtained for a = 1 and a = 2, respectively. As for score functions of 
the form K^, an important particular case is that of van der Waerden 
or normal scores, obtained for f\ = 4>\. Then, denoting by ^f k the chi- 
square distribution function with k degrees of freedom, Kfa(v) = ^~^ l {u), 
<Jk(<t>i) = k(k + 2), and £k(4>i) = 2k. Similarly, writing G k , v for the Fisher- 
Snedecor distribution function with k and v degrees of freedom, Student den- 
sities h = flu Yield K f tju) = k(k + u)G^{u)/{u + kG^(u)), J k (f{ v ) = 
k{k + 2){k + v) / {k + u + 2) and C k {f\ v ) = 2ku/{k + v + 2). 

3. Semiparametric modeling of the family of elliptical densities. 

3.1. Scatter, scale, and shape. Consider an observed n-tuple Xi, . . . ,X n 
of i.i.d. /c-dimensional elliptical random vectors, with location 6, scatter S, 
and radial density f\ G T\ but otherwise unspecified. The family of dis- 
tributions for this observation is indexed by (9, S, f\). As soon as a semipara- 
metric point of view is adopted, or when rank-based methods are considered, 
the scatter matrix 5] naturally decomposes into 5] = er 2 V, where a is a scale 
parameter (equivariant under multiplication by a positive constant) and V 
a shape matrix (invariant under multiplication by a positive constant). A 
semiparametric model with specified a and unspecified standardized radial 
density f\ indeed would be highly artificial, and we, therefore, only consider 
the case under which a and f\ are both unspecified. This semiparametric 
setting is also the one that enjoys the group invariance structure in which 
the ranks and the signs to be used in our method spontaneously arise from 
invariance arguments; see Section 3.2 below. 

The concepts of scale and shape however require a more careful definition. 
Denoting by S k the collection of all k x k symmetric positive definite real 
matrices, consider a function S :Sk — > Mq" satisfying S(XH) = A5(S) for all 
A 6 M.q and 5] G S k , and define scale and shape as as ■= (S^S)) 1 / 2 and 
:= S/S'(S), respectively. Clearly, Vg is the only matrix in S k which 
is proportional to XI and satisfies S(Vs) = 1: denote by Vjf := {V G S k ■ 
S(V) = 1} the set of all possible shape matrices associated with S. Classical 
choices of S are (i) 5(E) = (E)n ([14, 17, 23] and [33]); (ii) 5(E) = A:" 1 tr(E) 
([8, 28] and [39]); (iii) 5(E) = [E] 1 ^ ([9, 34, 36] and [37]). 

In practice, all choices of 5 are essentially equivalent. Bickel (Example 4 
of [4], with a trace-based normalization of E _1 ) actually shows that irre- 
spective of 5, the asymptotic information matrix for Vg in the presence 
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of unspecified 6 and as is the same, at any 6 G R fc , as G Rq~, Vg G 
and /i, whether /i is specified (parametric model) or not (semiparametric 
model): once 6 and 0-5 are unspecified, an unspecified f± does not induce 
any additional loss for inference about V5. In [30], Paindaveine establishes 
the stronger result that the information matrix for V5 in the presence of 
unspecified 0, as and f\ is strictly less, at any 6 G R fc , as G Rq", V5 G 
and fx, than in the corresponding parametric model with specified 0, as 
and fi — except for S:Y1>-^ [Sl 1 ^, where those two information matrices 
coincide: under this determinant-based normalization, thus, the presence of 
nuisances (0, as and /1) (resp., 9, V5, and /1) asymptotically has no effect 
on inference about shape (resp., inference about scale). In both cases, it can 
be said (adopting a point estimation terminology) that shape can be esti- 
mated adaptively. This Paindaveine adaptivity, however, where 0, as and /1 
lie in the nuisance space of the semiparametric model, is much stronger than 
Bickel adaptivity where only f± does. This finding strongly pleads in favor 
of the determinant-based definition of shape which, with its block-diagonal 
information matrix for 6, as and V5, is also more convenient from the point 
of view of statistical inference, and as we will see in Section 5.1, allows for 
an ANOVA-type decomposition of the test statistics into two mutually in- 
dependent parts providing tests against subalternatives of scale and shape 
heterogeneity, respectively. Therefore, throughout we adopt 5(S) = 
and henceforth simply write V G Vk and a for the resulting shape and scale. 
The parameter in our problem then is the L-dimensional vector 

:= (^,^,4)' := (0[, . . . , 6' m , al...,a 2 m , (vechVi)', . . . , (vech V m )')', 

o 

where L = mk(k + 3)/2 and vech(V) is characterized by vech(V) =: ((V)n, 
(vechV)')'. Indeed, 5^ is entirely determined by af and vech(Vj). Write 
for the set W nh x (Rj" ) m x (ve°ch(Vfc)) m of admissible # values, and or 

(n) 

^19/1?// i?//r/i ^ or f ne j°i n t distribution of the n observations under parameter 
value $ and standardized radial density f\ (always implicitly assumed to 
belong to when notation f\ is used). 

Finally, note that (i) Uy(0i, Ej) = Vij(6i, Vj) and (ii) dy(0 i ,S i ) = 
a^ 1 dij{6i,^Vi). It follows from (ii) that under 7io, the ranks of the radial 
distances computed from the common value V of the shape matrices co- 
incide with those of the standardized radial distances computed from the 
common value S of the scatter matrices. 

3.2. Invariance issues. Denoting by A^(T) the vector space spanned by 
the columns of the L x r full-rank matrix T (r < L), the null hypothesis 
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of scatter homogeneity Ho : <r 2 Vi = ■ ■ ■ = o"^V m can be written as TCq : ■d £ 
M(T), with 













T := | 












xO 





Tin 




(3.1) 

fc(fc + l) 
*»,:=— 2—- 1, 

where l m := (1, . . . , 1)' G M m and denotes the ^-dimensional identity ma- 
trix. 

Two distinct invariance structures play a role here. The first one is related 
with the group of affine transformations of the observations, which generates 
the parametric families P^l := [j-ffeMCT)^ 3 ^^} • More precisely, this group 
is the group Q m,k ,o of affine transformations of the form Xjj i— > AXjj + b^, 
where A is a full-rank (k x fe) matrix and B := (bi,...,b m ) a (k X m) 
matrix. Associated with that group is the group Q m > k ,o of transformations 
i9 i — ► g^g(iJ) of the parameter space, where 

g£* (0) := ((Afl! + bO', . . . , (A0 m + b m )', lAp/Vf, . . . , |A| 2/fc <r^, 

(ve°ch(AViA'))7l A| 2/fc , . . . , (ve°ch(AV m A))7l A| 2/fe )'. 
Clearly, 7{q is invariant under Q m < k ; o — meaning that g^g(.M(Y)) = A4(T) 

for all g^'g. Therefore, it is reasonable to restrict to affme-invariant tests of 
7^0- Beyond their distribution- freeness with respect to the O^s and the com- 
mon null values a and V of the scale and shape parameters, affme-invariant 
test statistics — that is, statistics Q such that Q(AXn +bi, . . . , AX m „ m + 
b m ) = Q(Xn, . . . , X mnm ) for all A, bi, . . . , b m — yield tests that are coordinate- 
free. 

A second invariance structure is induced by the groups Q,o 
continuous monotone radial transformations, of the form 

X ^ £ 9 ( X 11> • ■ • i X mn m ) 

= + V)V 1 /2 Ull (0 1 , V), . . . , 

m + d mnm (^m, V)V 1 / 2 U mnm (0 m , V)) 

:= (0i + <?(dn(0i, V))vV 2 Uii (0i, V), . . . , 

m + 5(dmn m (0 m ,V))V 1 / 2 U„ mm (0 m ,V)), 

where g : R + — ► M + is continuous, monotone increasing, and such that g(<S) = 
and linij^oo g{r) = oo. For each V, this group ^ l3/ ' V , o is a generating 



RANK TESTS FOR SCATTER HOMOGENEITY 11 
group for Va^v :=Uo-U/i{P } ( a nonparametric fam- 

t? Jl( T2l m ,l m (g)(vechV);/i 

ily). In such families, the invariance principle suggests basing inference on 
statistics that are measurable with respect to the corresponding maximal in- 
variant, namely the vectors (Un , . . . , U mrim ) of signs and the vector (Rn , . . . , 

Rmn m ) of ranks, where Ujj = Uy-(0i,V), and Rij = Rij{9u ■ ■ ■ , Om, V, . . . , V). 

(ri) 

Such invariant statistics of course are distribution- free under 

4. Uniform local asymptotic normality, signs and ranks. 

4.1. Uniform local asymptotic normality ( ULAN). As mentioned in Sec- 
tion 1, we plan to develop tests that are optimal at correctly specified den- 
sities, in the sense of Le Cam's asymptotic theory of statistical experiments. 
In this section, we state the ULAN result (with respect to location, scale 
and shape parameters for fixed radial density fx) on which optimality will 
be based. Writing 

= . . . , 0t\ o?<»\ • • • , (vechV^)', . . . , (vechvW)')' 

for an arbitrary sequence of L-dimensional parameter values in 0, consider 
sequences of "local alternatives" + n~ l ^ 2 u^T^ n \ where 

_(n) _ (_(")' Jn)i (n)/y 
T —\ T Ii T II ' T III > 

= (tf* . . . , tt\ sf n \ . . . , (vechv^)', . . . , (vech vW)')' 

is such that sup n r^VW < oo and where, denoting by A^ n ) = (A^) the 
(m x m) diagonal matrix with A^ := (A^) -1 / 2 (see Section 2.2), 



!» : = | o i/W 

o „w 

[under Assumption (B'), we also write v for lim n ^ 00 i/( n ']. Clearly, these 
local alternatives do not involve (vj )n, i = 1, . . . , m. It is natural, though, 
to see that the perturbed shapes V"j n ^ + 71$ remain [up to o{n i 1 ^ 2 )'s] 

within the family Vk of shape matrices: this leads to defining (vj n ' l )ii in 
such a way that tr[(vf l) )- 1 v| n) ] =0, i = l,...,m, which entails |vj n) + 
nr 1/2 v f = 1 + o(nr 1/2 ) (see [18], Section 4). 

The following notation will be used throughout. Let diag(Bi, B2, • • • , B m ) 
stand for the block-diagonal matrix with diagonal blocks Bi, B2,...,B m . 
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Write V® 2 for the Kronecker product V (8) V. Denoting by eg. the £th vector 
of the canonical basis of R k , let K fc := J2i,j=i( e i e 'j) ® ( e j e 'i) be the (k 2 x k 2 ) 
commutation matrix, and put := (veclfc)(veclfc)'. Finally, let Mfe(V) be 

the (ko x k 2 ) matrix such that (Mfc(V))'(vechv) = (vecv) for any symmet- 
ric k x k matrix v such that tr(V~ 1 v) = 0. As shown in Lemma 4.2(v) of [30], 
M fc (V)(vec V" 1 ) = for all V G V k . 

We then have the following ULAN result; the proof follows along the same 
lines as in Theorem 2.1 of [30], and hence is omitted. 



Proposition 4.1. 
Then the family Vl^ 



Assume that (A) and (B) hold, and that fx 6 J- a . 
= {P^Jj?G0} is ULAN, with central sequence 



u 





















/ A 77 ' 1 \ 


a/// _ 


f 


.11, m 




I a. 777 '" 1 , 



where [with dij = djj(#j, Vj) and U« = U.y Vj)/ 



1/2 n, 



(7 



A 



>;/i 



3=1 

1/2 m 



di 



-1/2 



u 



di,- \ d 



awl?!; 



III,i 



1/2 



-M fe (Vi)(Vf 



- 1/2 E^^)^vec(U^ 

3 = 1 



'J' 



i = 1, ... ,m, and full-rank block- diagonal information matrix 

(4.2) r* ;/l :=diag(r 7 ;/l ,r 77 /l ,r 777 1 ), 

w/iere, defining a := diag(cii, . . . , a m ), V := diag(Vi, . . . , V m ), M fc (V) := 
diag(M fe (V 1 ),...,M fe (V m )) and V^:=diag(Vf 2 ,...,V® 2 ), we Zet 



1 



■p/i i 

1 •- I 



RANK TESTS FOR SCATTER HOMOGENEITY 



13 



and 

1% := 4 ^ ( y 2) M fc (V)[I m ® (I fe2 + K fe )](V^) -1 (M fc (V))'. 

More precisely, for any any bounded sequence T^ n ' , 

we have 

A (n) — WdP (n) /dP (n) ) 

= (rW)'AW ;/i -^(rW)'r i?;/l rW +0 p(l) 

and A^(n).^ — — >■ jV(0, T^.jJ under P^J^ , asn^oo. 

The classical theory of hypothesis testing in Gaussian shifts (see Section 
11.9 of [26]) provides the general form for locally asymptotically optimal 
(namely, most stringent) tests in ULAN models. Such tests, for a null hy- 
pothesis of the form ■& £ A4(T), should be based on the asymptotically 
chi-square null distribution of 

Q r := A^I^fpE - proj(r^ i ( l ,W)- 1 T)]r i?; 1 / ( 2 A 1?;/l 

[with i? replaced by an appropriate estimator see Assumption (D) below], 
where proj(A) = A[A'A] _1 A / , for any (L x r) matrix A with full rank r, is 
the matrix projecting M L onto M(A). Whenever T^-f 17 u^ n ' and T all hap- 
pen to be block-diagonal, which is the case in our problem, this projection 
matrix clearly is block-diagonal, with diagonal 

blocks proj((r^ ;/i ) 1 /2 ( ^) ) -i X/)5 pro j((r4 / ;/i ) 1 /2 ( ^) ) -i X//)) and 

proj((r4f /l ) 1/2 (^S)~ lT ///)> denoting projections in W nk , M. m and M. mk ° , 
respectively. Since moreover M{{Y I ^.^ 1 ) l / 2 {u^)~ l x Tj) = W nk , we have 
proj((r^.j i ) 1 / 2 (i>'^)~ 1 T/) =I m fci so that Q-f reduces to 

x [i - proj((r^ ;/i ) 1 / 2(i ,w ) -i T//)]( r4{ /i) -V2 A // /i 
+ (A^ fl )'(r^ i )" 1 / 2 

x[I-proj((r^ i )V 2(ly W ) -i T///)](r ^ i) -i/ 2A ^ i , 

where A^.^ does not play any role. Accordingly, in the next section, we 

proceed with rank-based analogues of A^.^ and A^ x only. 

Note that the decomposition (4.3) of Qt into two asymptotically orthog- 
onal quadratic forms corresponds to the decomposition of scatter hetero- 
geneity into scale and shape heterogeneity; each of the two quadratic forms 
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in the right-hand side of (4.3) actually constitutes a locally asymptotically 
optimal test statistic for TCq against one of those two subalternatives. 

4.2. A rank-based central sequence for scale and shape (scatter). A gen- 
eral result by [20] implies that, in adaptive models for which fixed- f\ sub- 
models are ULAN and fixed- 19 submodels are generated by a group in- 
variant versions of central sequences exist under very general assumptions. 
In the present context, this result would imply the existence, for the null 
values of #, of central sequences based on the multivariate signs U« and the 
ranks R^. Although that result does not directly apply here, it is very likely 
that it still holds. This fact is confirmed by the asymptotic representation 
of Lemma 4.1 (i) below. 

Consider the signed-rank statistic [associated with some score function K 
satisfying Assumption (C)] A*.* := ((Ag K )', (A% K )')' := (Aj£, . . . , 

(aS)'--(O' where 

and 

(4-5) Ag£ := ^M fc (V,)(Vf )- 1/2 £ K^j) vec(U^Uy. 

The following lemma provides (i) an asymptotic representation and (ii) the 
asymptotic distribution of A#.^ (see the Appendix for the proof). An im- 
mediate corollary of (i) is that A# ; / a := A^^y , with K = Kf 1} actually 

constitutes a signed-rank version of the scatter part ((A^.^)', (A^.^)')' of 
the central sequence A^ ; ^ . 

Lemma 4.1. Assume that (A), (B) and (C) hold. Fix ■& G M(T) (with 
common values a and V of the scale and shape parameters). Let be 
the rank of dij := dij(6i,V) among dn, . . . ,d mnrn , and let XJij := Ujj(0j,V). 
Then, for all g± G J-\ : 

(i) A#. K = A#. K;gi + o L 2(l), under P#) gi , asn^oo, where A^. K;gi := 
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and 

Tli y / W \ \ 

(4.6) A^ ;gi := ^M fe (V)(V^ 2 )-V 2 ^^^ u 0j j vec^U^.); 

(ii) de/mm 5 H fc (V) := ^^M fc (V)[I fe2 + R^V® 2 )- 1 ^^))', 
Atf-K; gi is asymptotically normal with mean zero and mean 

( 4^rA(#,<?i)T// \ 

\J"fc(^,5l)[Im<8)H fc (V)]T 77/ y 
under P^) gi and P^ re -i/2 I/ (») 1 -. g j respectively, and covariance matrix 

T#. K := diag(r$ K ,r^) := diag^£ fe (^)I m , Jfc(ir)[I m <g> H fc (V)A 

under both (the claim under P^. n -i/2 w (») T . further requires g\ G J^aj. 

As mentioned in the description of the most stringent tests (see the com- 
ments after Proposition 4.1), we will need replacing the parameter $ with 
some estimate. For this purpose, we assume the existence of # := sat- 
isfying: 

Assumption (D). The sequence of estimators (&( n \n G N) is: 

(Dl) constrained: P^[0 (n) G A4(T)] = 1 for all n, G A4(Y), and gi € J-\; 
(D2) n 1 / 2 (i/( n ))- 1 -consisteni: for all G A4(T), n 1 / 2 (i/ (n) )~ 1 (0 (n) - 0) = 

P (1), as n -> oo, under U^e^lP^J; 
(D3) locally asymptotically discrete: for all # G A'f(Y) and all c > 0, there 

exists M = M(c) > such that the number of possible values of ??( n ) 

in balls of the form {t G R L :n 1 / 2 \\(u^)~ 1 (t - < c} is bounded by 

M as n — > oo, and 

(D4) affine-equivariant: denoting by #( n )(A, B) the value of I &^ n ' computed 
from the transformed sample AXjj + bj, j = 1, . . . , rii, % = 1, . . . , m, 

#»>(A,B) = gA&(# n) ), for all G S m ' fc . 

There are many possible choices for However, still in order to avoid 
moment assumptions, we propose the following estimators, related with the 
affine-equivariant median proposed by [23]. For each i = 1, . . . , m, let 0i and 
Vj be characterized by 

-J2U ij (9 i ,V i ) = and —^\J ij {e i ,V i )V^(e i ,V i ) = -rlk, 

J=l J=l 
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with |Vj| = 1. Then, under 1? G Ai(T), the common value V of the Vj's 
is consistently estimated [as n — > oo, under U gi eF 1 {P^ gi } an d Assumptions 
(A) and (B), without any moment assumption on gi], at the rate required by 
Assumption (D2), by the Tyler [39] estimator V computed from the n data 
points Xj,- — 6i and normalized in such a way that |V| =1. Under the same 
conditions, the null value a of the scale is the common median of the i.i.d. 
radial distances V), so that the empirical median a of the dij(6i, V)'s 

can be used as an estimator of a. Consequently, the estimator 

(4.7) := (d[, . . .,6' m ,a 2 l' m , l' m ® (vech V)')' 

satisfies (D2) above — except perhaps for the part which, however, is not 
involved in the test statistics below. This estimator also satisfies (Dl) and 
(D4). As for (D3), it is a purely technical requirement, with little practical 
implications (for fixed sample size, any estimator indeed can be considered 
part of a locally asymptotically discrete sequence). Therefore, we henceforth 
assume that (4.7) satisfies Assumption (D). 

The resulting ranks Rij := Rij{6\, . . . ,6 m , V, . . . , V) are usually called 

aligned ranks. The following asymptotic linearity result describes the asymp- 

(n) 

totic behavior of the aligned version A^.^ of A& } k under P^.^; see the 
Appendix for the proof. 

Proposition 4.2. Assume that Assumptions (A), (B), (C) and (Dl)- 
(D3) hold. Let g\ G T a and G (with common values a and V for 

the scale and shape parameters). Then, 

A 'l,K ~ + ^jO- k (K,g 1 )(u^r 1 n^ ( h ) " *n) 

and 

A% R - A% K + J k (K, gi )[I m H fc (V)](^)-V/ 2 (^g - # m ) 

are op(l) under P^ gi , as n — > oo. 

5. Optimal tests of scatter homogeneity. 



5.1. Optimal rank-based tests. For all # G Ai(T) (with common values a 
and V for the scale and shape parameters), define 

- 4(j4 [t -c^l 
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and 

■ — \ L d;K ) \ v III ) 1 III { 1 III III ) 1 &;K\ l/ IIl) 1 HI ) 1 /// \ v III ) 

= {J k {K))- l [l rn - CW] ® (Hfc(V))- 1 , 
where = (Cffi) denotes the (m x m) matrix with entries cffi := (A$ A^) 1 / 2 - 

(n) 

The .fT-score version <p K of the rank-based tests we are proposing rejects 

TCo :i? G .M(Y) whenever the (affine-invariant) test statistic 
nW — CA? VP? a( 7 + (A 1 - 11 )'Pi n A 1 - 11 



m r r 1 

<ki' 1 



E 

i,i'=l 



re,- n 



j=lj'=l 



C k {K)\ \n + lj J\ \n + l 



+ Hk + i^ K /R ll _\ K (R llJ 



2J k (K) \n + lj \n + l, 

"(W-i)) 

exceeds the a-upper quantile X( m _i)(fc 0+ i)-i_ Q of the chi-square distribution 
with (m — l)(ko + 1) degrees of freedom (Si^i stands for the usual Kronecker 
symbol); the explicit form of (H^V))^ 1 allowing for (5.1) can be found in 

Lemma 5.2 of [18]. In the sequel, we write (j> and for <f> x f an d Qk) > 
respectively. 

The decomposition (5.1) of the rank-based test statistic Q$ into two 

asymptotically orthogonal terms parallels the corresponding decomposition 
(4.3) of Q-f (see the closing remark of Section 4.1), with the same interpre- 
tation in terms of subalternatives of scale and shape heterogeneity, respec- 
tively. 

We are now ready to state the main result of this paper; for the sake 
of simplicity, asymptotic powers are expressed under Assumption (B') and 
perturbations r"' such that ]ha n -- too i'^T^ n ' = ur ^ M(T), with UnTn = 

(sf/VAi, •■ • ,«m/\ / V) / and vniTm = ((vech vi)'/VAl, • ■ • , (vech v m )'/v / A^) / - 
For any such r and any 1? € A4(T) (still with common values a and V of 
the scale and shape parameters), let 

rJ T :=ilim{(rW)'[I ra -CW]rW} 
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(5.2) 



E 



\ \ . / „2 „2 \ 2 

A l A l i ( Sj Sj/ 



\<i<v <m 



and 
(5.3) 



rg. := 2fc(fc + 2) lim {(rg)'[[I m - CW] ® H fe (V)]rg} 

, / V.- V.-7 \ \ ^ 

V 



W' tr 

l<i<i'<m 



note that tr(V _1 Vj) = for all i (see the comments before Proposition 4.1). 

Theorem 5.1. Assume that (A), (B), (C) and (D1)-(D3) hold. Then: 
(i) is asymptotically chi-square with (m— l)(£;o + l) degrees of free- 

dom under U#e.A/f(Y) UgiG^a^!?-)/! J"' anc ^ [provided that (B) is reinforced into 
(B')y asymptotically noncentral chi-square, still with {m — l)(/co + 1) degrees 
of freedom, but with noncentrality parameter 

K ' 4C k {K) * T + 2fc(fc + 2) J k (K) r * T 



underVf +n _ 1/WT[n) . gi , 0eM(T), ur := lirn^ i>M n > £ >t(T), and 

(ii) i/ie sequence of tests (j)^ has asymptotic level a under 

(iii) if fi € JT a and satisfies Assumption (C), i/ie sequence of tests 
In) 

cf) ^ is locally and asymptotically most stringent, still at asymptotic level a, 
/ 0r U^eA<(T)U 5l 6^ o {P^ 1 } against alternatives of the form U^x(T){ p ^/ 1 }- 
See the Appendix for the proof. After some algebra, one obtains 

Q ( k ] = 1 - E (^+^0Q£U 



Tl 

l<i<i' <m 



where 
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(5.5) 



C k (K) 



n 



5=1 



ra+ 1 



+ 



£:(£; + 2) 



n iJ^{ Vn+1. 



is the test statistic obtained in the two-sample case (for populations i and i'); 
see [40] for a similar decomposition in MANOVA problems. As announced, 
no estimate of the common scale appears in the test statistics. Also, 
letting 



the statistics in (5.5) take the simple form 



n (n) 



rii + m' \Ck(K) 

k(k + 2) 
+ 2J k (K) 

riiUi' ( k(k + 2) 



tr 2 [S K ;i ~ Sk-,v] 



tr[(S K -,i - S K .i>) } - -tr [S K] i - S K;i >] 



n i + n il {2J k (K) 



tr[(Stf ; j - SK;i>) } 



k(J k (K)-k(k + 2)) ^ 2fc 

2J k (K)C k (K) tr [^-^ 



For Gaussian scores (i.e., for A" = K^; see Section 2.3), one obtains the 
van der Waerden test statistics 



(5.6) 
where 



l<j<i'<m 



(«) 

vdW-.i.i' ' 



VvdW;i,i' 



2(rii + rij 



tr[(S v dW;i — S v dW;i') 2 ]) 
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with S vd w;i : = YJjLi *fc 1 (-^ii/( n + l))UijU^.. The Student scores (i.e., 
K = K f t ; see Section 2.3 again) yield 



(5.7) Qf =- £ (n l + n l ,)Q^ , iin 

KKi <m 



where 

g(n) »»?V + i/ + 2 

~fi,v'' 1 ' 1 rii + rii' 2{k + v) 

x{tr[(S /t :i -S /t ) 2 ] + itr 2 [S /t .< - Sy* .<,]} 

— ■'1,1/' — ■'1,1/ ' — •'1,1/' — ■'l,!/' I 

with Sy,^ := fc(fc + i/)nr 1 E^ 1 Gfc ) i(^ i /(n + l))/[i/ + fcG^(4-/(n+l))] x 

UijU^-. As for the tests associated with the power score functions K a , they 
are based on 

(5.8) 0g = i £ K + niOQg ;M „ «>0, 

l<j<i'<m 

where, letting S^ a;i := fc(a + l)(n + l)""^ 1 ES = i(%)°U<jU«, 



_ ( a 2 fc _ 4a _ 2) tr 2 [S^ - S Xa y]}. 



Corollary 5.1. Assume that the conditions of Theorem 5.1 hold. Then: 

(i) provided that g\ £ is suc/i i/iai Ck{K,g\) ^ / Jk{K,g\), <f>g is 

consistent under any local g\- alternative [i.e., under any P^. n -i/2 I/ (») T (n). fl > 

0£M(T), lim^oo i/^tW $M(T)]; 

(ii) i/ie same conclusion holds if uh if (it) absolutely continuous with 
a.e. derivative K, and if g\ G ,F a is smc/i #iai f^ K{G\k{r))r{gik{r)) 2 dr > 
(m particular, if K is nondecreasing). 

We refer to the Appendix for the proof. This corollary shows that the van 
der Waerden tests above, as well as those achieving local asymptotic strin- 
gency at prespecified Student or power-exponential densities, are locally con- 
sistent against arbitrary elliptical alternatives, since the corresponding score 
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functions are strictly increasing. Similar conclusions hold for the tests asso- 
ciated with the power score functions K a , a > 0. More general consistency 
results, against nonlocal and possibly nonelliptical alternatives, of course, 
are highly desirable, and can be obtained by exploiting Hajek's results on 
rank statistics under alternatives (see [13]), much along the same lines as 
in Section 5.2 of [17]. Such results, which we do not include here, imply 
that consistency is achieved at "almost all" nonlocal alternatives — the ex- 
ception being those very particular densities achieving a set of orthogonality 
conditions involving the scores and the Ujj's. 

Another important concern is validity under homogeneous scatter but 
possibly heterogeneous or/and nonelliptical densities. The ranks of the dij's 
then lose their distribution- freeness and/or their independence with respect 
to the Ujj-'s. Under totally arbitrary situations, little can be said, but it is 
extremely unlikely that the tests we are developing here remain valid; if ellip- 
ticity and finite moments of order four can be assumed, the pseudo-Gaussian 
methods developed in [19], which remain valid under possibly heterokurtic 
elliptic densities, are preferable. 

5.2. The optimal pseudo-Gaussian tests. As explained in the Introduction, 

(n) (n) 

the pseudo-Gaussian tests (j)^ and 0g c h o tt* are the natural competitors of 
our rank-based procedures. Since they are asymptotically equivalent (see 
[19]), we concentrate on <p^}\- This test is valid under finite fourth-order mo- 
ments E k (gi) := ^(G~l(u)) A du = (/z fe _i ;ffl ) _1 Jo°° r fc+3 #i(r) dr, that is, un- 
der any P^?L such that g\ 6 J-f := {g\ 6 T\ : E k {g\) < oo}. For all such <?i, 
let D k (gi) := Jq (G^ (u)) 2 du: then K k (gi) := p^J ^fe) ~~ 1 is a measure of 
kurtosis (see, e.g., page 54 of [1]), common to the m elliptic populations un- 
der . This kurtosis is consistently (still under , <? x e J^f) estimated 
by 

. k (^jXiS^i^ 1 

Kk - (k + 2) ( n -i YhLi YljLi d%j) 2 ' 

where dy := dij (Xj, Si), with Xj := n^ 1 Y^L\ Xj and Sj := n," 1 Ejii( x ij 
X X — Xj)'. At the multinormal (g\ = 4>i), E k (4>i) = k(k + 2)/a| and 
D k ((pi) = k/a k , so that K k ((j>i) = 0. 

The pseudo-Gaussian test <p^\ rejects Hq (at asymptotic level a) when- 
ever 

(5-9) Q { J}\-=- ( n i + n i')Q { N*;i,V >X( m -l)(fe +l);l- Q , 

l<i<i'<m 
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where, letting S := n _1 YaLi n A, 



n (n) ,_ njUji 1_ 



ni + m' 2(1 + k k ) 
x {, r[ (S-(S, - S„)f] - (fc + ^ + 2 ^[S-(S, - B, )]}. 
This test statistic is clearly afRne-invariant; the following Theorem (see [19]) 

(n) 

summarizes its asymptotic properties, which also are those of ^schott*- 
Theorem 5.2. Assume that (A) and (B) hold. Then: 

(i) Qjtfl is asymptotically chi-square with (m — l)(/co + 1) degrees of 
freedom under Ui?e.M(Y) U^e^O 3 ^^}) an ^ [provided that (B) is reinforced 
into (B')/ asymptotically noncentral chi-square, still with (m — l)(fco + 1) 
degrees of freedom but with noncentrality parameter 

(KirA & _// , 1 r /// 

1 j (fc + 2)K fe ( 5 i) + 2 ^ + 2(1 + kM) »> t 

underVf +n _ 1/2u{n)T{n) . gi , with G M(T), vr := lim^ i>M»> t M(T), 
gi G T^{;.= T\C\T a ), r T J T and r^ T defined in (5.2) and (5.3); 

(ii) 4>m\ has asymptotic level a under U^eX(T) U^e^i 1 *^}; 

(iii) (f)fy\ is locally and asymptotically most stringent, still at asymp- 
totic level a, for UueX(T) U^e^O 3 *^} a 9 a i n st alternatives of the form 

6. Asymptotic relative efficiencies. The asymptotic relative efficiencies 
of the rank-based tests with respect to (fifyl and </>gchott* directly follow 

as ratios of noncentrality parameters under local alternatives (see Theo- 
rems 5.1 and 5.2). 

Proposition 6.1. Assume that (A), (B'), (C) and (D) hold, and that 
gi G T^. Then, the asymptotic relative efficiency of (p^ with respect to 

4>m\, when testing PJJ^ against P^-i/a^M*)^ ft € M(T) and ur := 
hm^oovWrW i M(T)J, is 



(1 - OAREgf +CARE^ e HtP/^ll 
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where 

In -I \ A nn scale / , (n) / ,171) \ ((k + 2)n k (gi) + 2)C\(K,g 1 ) 

(6-1) ARE fc5i ^ K /4>^):= AkC^jK) ' 

ApF (shape), , (n) /,(»),_ (1 + «fc(gl)) Jk( K , gO 

(6-2) ARE fcj9i (<P K /^).- k{k + 2)Jk{K) 

and £ := £&, T ,k, gi G [0, 1] is given fry 

£tf,x,fc, si := ((* + 2)«fc0ft) + 2)r^ J T 

x [2fc(l + K fc ((/i))rg T + ((fc + 2)« fc 0a) + 2)rg.] _1 . 

The "shape AREs" in (6.2) coincide with those obtained in problems 
involving shape only — such as testing null hypotheses of the form V = Vo 
for fixed Vo (see [17]). Proposition 6.1 shows that the AREs, with respect 
to the pseudo-Gaussian tests of Section 5.2, of the rank tests proposed in 
Section 5.1 are convex linear combinations of these "shape AREs" and the 
"scale AREs" in (6.1). 

Numerical values of (6.1) and (6.2), for various values of the space dimen- 
sion k and various radial densities (Student, Gaussian and power-exponential), 
are given in Table 1 for the van der Waerden test i?^W' ^ ne Wilcoxon test 

4> ^ , and the Spearman test (j> ^ (the score functions K a , a > were de- 
fined in Section 2.3). These ARE values are uniformly large (with the ex- 
ception, possibly, of univariate scale Wilcoxon AREs), particularly so under 
heavy tails, as often in rank-based inference. Also note that the AREs of 
the proposed van der Waerden tests with respect to the parametric Gaus- 
sian tests are larger than or equal to one for all distributions considered in 
Table 1. For pure shape alternatives, [29] has shown that a Chernoff-Savage 

property holds, that is, inf 91 ARE^^ pc ^ (^vdlv/^A^) = ^" ® ne ma y won der 

whether this uniform dominance property of van der Waerden tests extends 
to the present situation. Although it does for usual distributions, including 
all Student and power-exponential ones, the general answer unfortunately is 
negative; see Section 4 of [29] for a (pathological) counter example. 

(n) (n) 

Note that Theorem 5.2 clearly shows that <p\fi and ^g^ott* are no ^ efficiency- 
robust. Indeed, the noncentrality parameter (5.10) under Student radial 
densities with 4 + 5 degrees of freedom tends to zero as 5 — > 0, so that 
asymptotic local powers are arbitrarily close to the nominal level a. The 
efficiency-robustness of our rank tests, quite on the contrary, is not affected, 
as the ARE values (6.1) and (6.2), under the same Student densities with 
4 + 5 degrees of freedom, both tend to infinity as S — > 0. 
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Table 1 

AREs, for £ = (pure scale alternatives) and £ = 1 (pure shape alternatives), of the van 
der Waerden (vdW), Wilcoxon (W), and Spearman (SP) rank-based tests with respect to 
the pseudo- Gaussian tests, under k -dimensional Student (with 5, 8 and 12 degrees of 
freedom), Gaussian, and power-exponential densities (with parameter n — 2,3,5), for 

k = 2, 3, 4, 6, 10 and k — > oo 



Underlying density 



k 


e 


u 


t 8 


tl2 


Af 


e 2 


e 3 


e 5 


vdW 1 





2.321 


1.230 


1.082 


1.000 


1.151 


1.376 


1.822 


2 


1 




2.551 


1.280 


1.102 


1.000 


1.115 


1.296 


1.669 




1 


2.204 


1.215 


1.078 


1.000 


1.129 


1.308 


1.637 


3 





2.732 


1.322 


1.120 


1.000 


1.092 


1.241 


1.558 




1 


2.270 


1.233 


1.086 


1.000 


1.108 


1.259 


1.536 


4 


o 


2.881 


1.358 


1.136 


1.000 


1.077 


1.202 


1.475 




1 


2.326 


1.249 


1.093 


1.000 


1.093 


1.223 


1.462 





n 
u 


O.IUO 


1 A 1 f\ 


i i^q 

l.-LOO 


l.UUU 


l.UO I 


i i 

J LOl 


l.ODl 




1 


2.413 


1.275 


1.106 


1.000 


1.072 


1.174 


1.363 


10 





3.403 


1.498 


1.204 


1.000 


1.037 


1.099 


1.239 




1 


2.531 


1.312 


1.126 


1.000 


1.050 


1.121 


1.254 


oo 





4.586 


1.894 


1.446 


1.000 


1.000 


1.000 


1.000 




1 


3.000 


1.500 


1.250 


1.000 


1.000 


1.000 


1.000 


W 1 





1.993 


0.939 


0.769 


0.608 


0.519 


0.509 


0.517 


2 


1 




2.604 


1.185 


0.959 


0.750 


0.694 


0.703 


0.743 




1 


2.258 


1.174 


1.001 


0.844 


0.789 


0.804 


0.842 


3 





2.929 


1.304 


1.045 


0.811 


0.775 


0.795 


0.854 




1 


2.386 


1.246 


1.068 


0.913 


0.897 


0.933 


1.001 


4 





3.140 


1.377 


1.096 


0.844 


0.820 


0.844 


0.911 




1 


2.432 


1.273 


1.094 


0.945 


0.955 


1.006 


1.095 


6 





3.407 


1.467 


1.156 


0.879 


0.866 


0.892 


0.961 




1 


2.451 


1.283 


1.105 


0.969 


1.008 


1.075 


1.188 


10 





3.685 


1.560 


1.216 


0.908 


0.903 


0.925 


0.984 




1 


2.426 


1.264 


1.088 


0.970 


1.032 


1.106 


1.233 


oo 





4.323 


1.794 


1.374 


0.955 


0.955 


0.955 


0.955 




1 


2.250 


1.125 


0.938 


0.750 


0.750 


0.750 


0.750 


SP 1 





2.333 


1.126 


0.935 


0.760 


0.705 


0.724 


0.774 


2 


1 




2.737 


1.289 


1.063 


0.868 


0.868 


0.924 


1.038 




1 


2.301 


1.230 


1.067 


0.934 


0.965 


1.042 


1.168 


3 





2.913 


1.348 


1.105 


0.904 


0.924 


0.993 


1.136 




1 


2.277 


1.225 


1.070 


0.957 


1.033 


1.141 


1.317 


4 





3.016 


1.378 


1.125 


0.920 


0.949 


1.020 


1.170 




1 


2.225 


1.200 


1.051 


0.956 


1.057 


1.179 


1.383 


6 





3.137 


1.410 


1.142 


0.932 


0.966 


1.032 


1.176 




1 


2.128 


1.146 


1.007 


0.936 


1.057 


1.189 


1.414 


10 





3.255 


1.438 


1.154 


0.937 


0.969 


1.022 


1.139 




1 


2.001 


1.068 


0.936 


0.891 


1.017 


1.144 


1.365 


oo 





3.507 


1.503 


1.176 


0.895 


0.895 


0.895 


0.895 




1 


1.667 


0.833 


0.694 


0.556 


0.556 


0.556 


0.556 
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7. Simulations. We conducted two simulations, one for pure scale al- 
ternatives and another one for pure shape alternatives, both in dimension 
k = 2. More precisely, starting from two sets of i.i.d. bivariate random vec- 
tors £\j (J = 1, . . . , n\ = 100) and e 2 j (j = 1, . . . , rc 2 = 100) with spherical 
densities (the standard bivariate normal and bivariate t-distributions with 
0.5, 2 and 5 degrees of freedom) centered at 0, we considered independent 
samples obtained from 

Xij = Aieij + j = l,...,n 1 , 

and 

X 2 j = A 2 (£)e 2 j + #2, j = l,...,n 2 , 

where A 2 (^)A' 2 (^) = (1 + £s 2 )(Ai A[ + £v) [v a symmetric (k x k) matrix 
with tr((AiA' 1 )~ 1 v) = 0], £ = 0, 1, 2, 3. The values of £ produce the null (£ = 
0) and increasingly heterogeneous alternatives (£ = 1,2,3); all tests being 
affine-invariant, there is no loss of generality in letting Ai = I 2 and 9\ = 
2 = O. 

In the first simulation (pure scale alternatives), we generated N = 2,500 
independent samples, with v = and s 2 = 0.30, 0.44, 0.56 and 1.50 under 
Gaussian, £5, i 2 and to.5 alternatives, respectively; these values of s 2 have 
been chosen in order to obtain rejection frequencies of the same order un- 
der those four densities. In the second simulation (pure shape alternatives), 
we similarly generated ./V = 2,500 independent samples, with s 2 = and 

vechv = (0,0.18)', (0,0.20)', (0,0.21)' and (0,0.22)' under Gaussian, t 5 , t 2 
and £o.5 alternatives, respectively, still with the same objective of obtaining 
comparable empirical powers under the various densities considered. 

In each of these samples, we performed the following eight tests (all at 
asymptotic level a = 5%): (a) the modified likelihood ratio test <^mlrt> 
(b) the parametric Gaussian test 4>m (equivalently, Schott's original test 
^Schott)' ( c ) ^ s pseudo-Gaussian version <j)j}\, based on (5.9) (equivalently, 
the modified Schott test ^schott*)' (°0 the van der Waerden test </>^w [based 

on (5.6)]; (e)-(g) t^-score tests ^ v with v = 5, 2 and 0.5 [based on (5.7)], 

as well as (h) the Spearman test (p^ [based on (5.8)]. It can be checked 

(n) (n) 

that the Wilcoxon test (ft for k = 2, coincides with <ft ^ . 

Rejection frequencies are reported in Table 2 for pure scale alternatives, 
and in Table 3 for pure shape alternatives [the corresponding individual con- 
fidence intervals (for N = 2,500 replications), at confidence level 0.95, have 
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half-widths 0.0044, 0.0080 and 0.0100, for frequencies of the order of 0.05 
(0.95), 0.20 (0.80) and 0.50, resp.]. These frequencies indicate that the rank 
tests, when based on their asymptotic chi-square critical values, are conser- 
vative and significantly biased at moderate sample size (100 observations in 
each group). In order to remedy this, we also implemented versions of each 
of the rank procedures based on estimations of the (distribution-free) quan- 
tile of the test statistic under known location and known common null value 
of the shape. These estimations, just as the asymptotic chi-square quantile, 
are consistent approximations of the corresponding exact quantiles under 
the null, and were obtained for each of the five rank tests under consider- 
ation in (d)-(h) above, as the empirical 0.05-upper quantiles go. 95 of each 
rank-based test statistic in a collection of 10 5 simulated multinormal sam- 
ples, yielding go.95 = 7.2117, 7.6351, 7.7473, 7.7636 and 7.6773, respectively. 
These bias-corrected critical values all are smaller than the asymptotic chi- 
square one xi-o.95 = 7.8147, so that the resulting tests are uniformly less 
conservative than the original ones. The resulting rejection frequencies are 
given in parentheses. 

Inspection of Tables 2 and 3 confirms the fact that the parametric Gaus- 
sian tests 4>j\f, contrary to the pseudo-Gaussian ones </>jv"*> are invalid under 
non-Gaussian densities (culminating, under io.5> with a size of 0.9992). How- 
ever, even the pseudo-Gaussian tests 4>j\f*, though resisting non-Gaussian 
densities with finite fourth-order moments, are collapsing under the heavy- 
tailed to. 5 and ti distributions (with power less than 10 -4 under to. 5)- I n 
sharp contrast with this, all rank-based tests appear to satisfy the 5% 
probability level constraint. They are conservative in their original versions 
(particularly so for van der Waerden scores), but reasonably unbiased (for 
n i = n 2 = 100) after bias-correction. Empirical power rankings are essen- 
tially consistent with ARE values; in order to allow for meaningful com- 
parisons under infinite fourth-order moments, we also provide AREs with 
respect to the van der Waerden rank test. 

APPENDIX 

A.l. Proofs of Lemma 4.1, Theorem 5.1 and Corollary 5.1. 

PROOF of Lemma 4.1. (i) Fix re{l,...,m}. Clearly, under P^, 
A^ = A^ ;ffi+OL2 (l)iff 

m ni / E? \ m ni , /AW 

where S^. r := n i 1 ^ 2 5i ir . For -d £ .M(Y), the R^s are the ranks of the dij/cr's, 

(n) 

which under ~P^. gi are i.i.d. with distribution function G\k- The asymptotic 



Table 2 

Rejection frequencies (out of N — 2,500 replications), under the null and various scale alternatives (see Section 7 for details), of the 
Gaussian modified LRT (</>mlrt), the parametric Gaussian test {4>rt), its pseudo-Gaussian version (4>at*), and the signed-rank van der 
Waerden (0 v dw), t v -score (4> ft , v = 0.5, 2, 5), Wilcoxon-type (<fi k^) and Spearman-type (4>k 2 ) tests, respectively. Sample sizes are 

Tli =ri2 = 100. ARE values are provided with respect to the parametric pseudo-Gaussian (AKEfS*) and van der Waerden rank tests 
(ARE v dw); U ND" means "not defined" (this occurs as soon as one the tests involved is not valid under the distribution under 

consideration) 

1 



Test 







1 


2 


3 


AREaA* 


AREvdw 


4>LKT 


M 


0.0512 


0.3168 


0.7932 


0.9776 


1.000 


1.000 


4>MLRT 




0.0500 


0.3100 


0.7876 


0.9772 


1.000 


1.000 


</>Af 




0.0464 


0.3008 


0.7760 


0.9756 


1.000 


1.000 






0.0472 


0.2944 


0.7568 


0.9736 


1.000 


1.000 


0vdW 




0.0348 (0.0472) 


0.2388 (0.2932) 


0.6912 (0.7316) 


0.9520 (0.9676) 


1.000 


1.000 


<f) f t 




0.0444 (0.0496) 


0.2604 (0.2724) 


0.7080 (0.7200) 


0.9552 (0.9600) 


0.918 


0.918 


t*U = t Kl 




0.0516 (0.0524) 


0.2180 (0.2248) 


0.6360 (0.6404) 


0.9004 (0.9016) 


0.750 


0.750 


<f> ft 




0.0476 (0.0492) 


0.1224 (0.1248) 


0.3252 (0.3260) 


0.5692 (0.5724) 


0.360 


0.360 


<f>K 2 




0.0432 (0.0480) 


0.2448 (0.2572) 


0.6956 (0.7060) 


0.9480 (0.9508) 


0.868 


0.868 


<t>LKT 




0.3288 


0.6308 


0.9168 


0.9872 


ND 


ND 


0MLRT 




0.3244 


0.6260 


0.9144 


0.9868 


ND 


ND 






0.3160 


0.6208 


0.9092 


0.9856 


ND 


ND 






0.0300 


0.1896 


0.5268 


0.7892 


1.000 


0.392 


<PvdW 




0.0320 (0.0444) 


0.2500 (0.2956) 


0.7068 (0.7468) 


0.9396 (0.9560) 


2.551 


1.000 


<f) ft 




0.0428 (0.0480) 


0.3004 (0.3152) 


0.7740 (0.7812) 


0.9636 (0.9676) 


2.778 


1.089 


4> f t =(j> Kl 




0.0488 (0.0512) 


0.2916 (0.2980) 


0.7456 (0.7520) 


0.9528 (0.9544) 


2.604 


1.021 



Table 2 
Continued 



Test 







1 


2 


3 


AREaT, 


AREvdw 






0.0512 (0.0516) 


0.1824 (0.1848) 


0.4916 (0.4972) 


0.7556 (0.7612) 


1.543 


0.605 


4>K 2 




0.0448 (0.0484) 


0.3068 (0.3184) 


0.7720 (0.7828) 


0.9644 (0.9656) 


2.737 


1.073 


<^LRT 




0.8728 


0.9164 


0.9496 


0.9712 


ND 


ND 


0MLRT 




0.8696 


0.9156 


0.9496 


0.9700 


ND 


ND 






0.8648 


0.9120 


0.9480 


0.9684 


ND 


ND 


4>n* 




0.0120 


0.0300 


0.0672 


0.1276 


ND 


ND 


<j>vdW 




0.0428 (0.0568) 


0.1880 (0.2264) 


0.5368 (0.5816) 


0.7988 (0.8324) 


ND 


1.000 


<f> ft 




0.0536 (0.0592) 


0.2532 (0.2644) 


0.6592 (0.6704) 


0.9000 (0.9072) 


ND 


1.250 






0.0508 (0.0532) 


0.2732 (0.2816) 


0.6912 (0.6964) 


0.9212 (0.9236) 


ND 


1.333 


L J l,0.5 




0.0496 (0.0500) 


0.2116 (0.2136) 


0.5404 (0.5468) 


0.8128 (0.8144) 


ND 


1.000 


<£if 2 




0.0572 (0.0588) 


0.2568 (0.2652) 


0.6632 (0.6708) 


0.9036 (0.9080) 


ND 


1.250 


<^>LRT 


*0.5 


0.9992 


0.9996 


0.9996 


0.9988 


ND 


ND 


0MLRT 




0.9992 


0.9996 


0.9996 


0.9988 


ND 


ND 


<j>AT 




0.9992 


0.9996 


0.9988 


0.9988 


ND 


ND 


















ND 


ND 


<^vdW 




0.0388 (0.0520) 


0.1464 (0.1764) 


0.3096 (0.3572) 


0.4608 (0.5188) 


ND 


1.000 


f t 
~ 1.5 




0.0496 (0.0524) 


0.2328 (0.2452) 


0.5000 (0.5132) 


0.6920 (0.7044) 


ND 


1.543 






0.0508 (0.0528) 


0.3076 (0.3136) 


0.6404 (0.6448) 


0.8276 (0.8316) 


ND 


2.083 


ft 




0.0604 (0.0616) 


0.3928 (0.3972) 


0.7572 (0.7600) 


0.9208 (0.9212) 


ND 


2.778 


<^K 2 




0.0488 (0.0524) 


0.2136 (0.2228) 


0.4728 (0.4840) 


0.6672 (0.6792) 


ND 


1.435 



Table 3 

Rejection frequencies (out of N = 2,500 replications), under the null and various shape alternatives (see Section 7 for details), of the 
Gaussian modified LRT (</>mlrt), the parametric Gaussian test {4>rt), its pseudo-Gaussian version {4>M*), and the signed-rank van der 
Waerden (0 v dw), t v -score (4> ft , v = 0.5. 2, 5), Wilcoxon-type (<fi k^) and Spearman-type (4>k 2 ) tests, respectively. Sample sizes are 

Tli =ri2 = 100. ARE values are provided with respect to the parametric pseudo-Gaussian (AKEfS*) and van der Waerden rank tests 
(ARE v dw); U ND" means "not defined" (this occurs as soon as one the tests involved is not valid under the distribution under 

consideration) 

1 



Test 







1 


2 


3 


AREaA* 


AREvdw 


4>LKT 


M 


0.0512 


0.1564 


0.6032 


0.9668 


1.000 


1.000 


4>MLRT 




0.0500 


0.1532 


0.5984 


0.9656 


1.000 


1.000 


</>Af 




0.0464 


0.1484 


0.5900 


0.9640 


1.000 


1.000 






0.0472 


0.1444 


0.5812 


0.9648 


1.000 


1.000 


0vdW 




0.0348 (0.0472) 


0.1212 (0.1464) 


0.5248 (0.5828) 


0.9488 (0.9632) 


1.000 


1.000 


<f) f t 




0.0444 (0.0496) 


0.1452 (0.1536) 


0.5456 (0.5596) 


0.9464 (0.9496) 


0.945 


0.945 


t*U = t Kl 




0.0516 (0.0524) 


0.1364 (0.1392) 


0.4928 (0.5004) 


0.9272 (0.9276) 


0.844 


0.844 


<f> ft 




0.0476 (0.0492) 


0.1120 (0.1140) 


0.3996 (0.4036) 


0.8356 (0.8388) 


0.648 


0.648 


4>K 2 




0.0432 (0.0480) 


0.1440 (0.1508) 


0.5420 (0.5512) 


0.9460 (0.9488) 


0.934 


0.934 


<t>LKT 




0.3288 


0.4632 


0.7840 


0.9808 


ND 


ND 


0MLRT 




0.3244 


0.4600 


0.7816 


0.9800 


ND 


ND 






0.3160 


0.4512 


0.7728 


0.9796 


ND 


ND 






0.0300 


0.1020 


0.4204 


0.8552 


1.000 


0.454 


<PvdW 




0.0320 (0.0444) 


0.1268 (0.1592) 


0.5320 (0.5816) 


0.9576 (0.9692) 


2.204 


1.000 


<f) ft 




0.0428 (0.0480) 


0.1572 (0.1676) 


0.5928 (0.6036) 


0.9720 (0.9740) 


2.333 


1.059 


4> f t =(j> Kl 




0.0488 (0.0512) 


0.1608 (0.1632) 


0.5876 (0.5916) 


0.9684 (0.9692) 


2.258 


1.024 



Table 3 
Continued 



Test 









1 


2 


3 


AREaT, 


AREvdw 






0.0512 (0.0516) 


0.1376 


(0.1388) 


0.5088 (0.5132) 


0.9312 (0.9332) 


1.896 


0.860 


4>K 2 




0.0448 (0.0484) 


0.1612 


(0.1704) 


0.5860 (0.5976) 


0.9700 (0.9716) 


2.301 


1.044 


<^LRT 




0.8728 


0.8912 




0.9376 


0.9768 


ND 


ND 


0MLRT 




0.8696 


0.8892 




0.9364 


0.9768 


ND 


ND 






0.8648 


0.8864 




0.9332 


0.9764 


ND 


ND 


4>n* 




0.0120 


0.0224 




0.0808 


0.2380 


ND 


ND 


<j>vdW 




0.0428 (0.0568) 


0.1180 


(0.1488) 


0.4596 (0.5120) 


0.9216 (0.9416) 


ND 


1.000 


<f> ft 




0.0536 (0.0592) 


0.1488 


(0.1560) 


0.5460 (0.5572) 


0.9576 (0.9616) 


ND 


1.147 


t f i,2 = t Ki 




0.0508 (0.0532) 


0.1584 


(0.1612) 


0.5640 (0.5668) 


0.9668 (0.9668) 


ND 


1.185 


L J l,0.5 




0.0496 (0.0500) 


0.1508 


(0.1524) 


0.5212 (0.5256) 


0.9412 (0.9420) 


ND 


1.089 


<£if 2 




0.0572 (0.0588) 


0.1440 


(0.1500) 


0.5288 (0.5420) 


0.9516 (0.9564) 


ND 


1.111 


<^>LRT 


^0.5 


0.9992 


0.9988 




0.9992 


0.9992 


ND 


ND 


0MLRT 




0.9992 


0.9988 




0.9992 


0.9992 


ND 


ND 


<j>AT 




0.9992 


0.9988 




0.9992 


0.9992 


ND 


ND 














0.0004 


0.0008 


ND 


ND 


<^vdW 




0.0388 (0.0520) 


0.0964 


(0.1208) 


0.3328 (0.3792) 


0.7960 (0.8328) 


ND 


1.000 


f t 
L '1,5 




0.0496 (0.0524) 


0.1280 


(0.1356) 


0.4288 (0.4408) 


0.8928 (0.9004) 


ND 


1.254 






0.0508 (0.0528) 


0.1396 


(0.1440) 


0.4840 (0.4880) 


0.9360 (0.9380) 


ND 


1.418 


4> t* 

L J l,0.5 




0.0604 (0.0616) 


0.1644 


(0.1648) 


0.5356 (0.5388) 


0.9560 (0.9568) 


ND 


1.543 


<^K 2 




0.0488 (0.0524) 


0.1208 


(0.1272) 


0.3968 (0.4064) 


0.8624 (0.8704) 


ND 


1.138 
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equivalence (A.l) thus follows from Hajek's classical projection result for 
linear rank statistics (see, e.g., [32], Chapter 2), since (a) the cjj.^'s are not 
all equal and (b) 



I (n) _i -r-^ (n) \o 



n max 



1 - A 



(n) 



X (») 



1 - A< n) 



ofll 



as n — ► oo 



(the Noether condition) holds; see the comments after Assumption (B). 
Similarly, for the shape part, A^.^ = A^.^ + o £ a(l) under pjj^ iff 



n 



-^m^V)^® 2 )- 1 / 2 ^ 
i=i 

= o L2 (l) 



A" 



n+1 



K(G lk 



22. 

a 



J^vec(U rj U; 



[where := I k 2 - \3 k satisfies M^V)^ 2 )" 1 ^^ = M k (V)(V m y^ 2 
and is such that J^- vec(U r jU^) is exactly centered], or equivalently iff 



(A.2) 



m rii 
(«) ._ V%( n ) 

i=lj=l 



A 



R 



i.i 



n+l 



[J^vec(U 4J U: 



i j j 



for all t e {1, 2, . . . , k 2 }, still under pjj^ . Now, 



A" 



A, 



n + l 



K[G lk 



2-. 



where, denoting by Uij >s the sth component of Uy, Ce, k = V&t[U 2 11 ] = 
2(k - l)/{k 2 {k + 2)) for £ e £ k := {mk + m + 1, m = 0, 1, . . . , k - 1}' and 
Cf,fc = Var[LAi, 1^11,2] = 1/A; 2 for £ ^ Here, the Hajek projection result 
for linear signed-r&nk statistics (see, e.g., [32], Chapter 3) yields (A.2), since 

/ ( n ) \2 / r ( n ) \2 — 1 /i \ 

max ij(%>) /22i,j{ c ir,r) = n r = oi 1 ), as n ^ oo. 

As for (ii), the result straightforwardly follows, under P^) gi with -d € 
A4(T), from the multivariate CLT. The result under local alternatives is 

(n) 

obtained as usual, by establishing the joint normality under P^.^ of A^-K- gi 

and ^] rn -i/2 v {n.) T i^. g i then applying Le Cam's third lemma; the required 
joint normality follows from a routine application of the classical Cramer- 
Wold device. □ 
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Proof of Theorem 5.1 . (i) The continuity of the mapping # \-> (P#. K , 
^Wk)i Proposition 4.2 (jointly with Assumption (Dl) and the fact that 
[I m - C( n )](A( n ))~ 1 l m = 0), and Lemma 4.1(i), entail 

QP = (A&rJ'P&r + (A^)'P^A^ + o P (l) 

(A.3) 

= (A.&,K; gi ) P i?;X^i9;A- ;!? i + {&-d;K m )' P i9;A'^i9;A'; 9 i +Op(l) 

under P^L, # € M(T) (and therefore, also under the contiguous sequence 

P S+«-i/M»)TW ifl i)- Now ' since ( r 4 J ;K) 1/2p fc( r ^V) 1/2 is a symmetric idem- 
potent matrix with rank m — 1, it follows from Lemma 4.1(h) that the first 
term in (A.3) is asymptotically chi-square with m — 1 degrees of freedom 

under P^) gi , "& € M(T), and asymptotically noncentral chi-square, still with 
m — 1 degrees of freedom, but with noncentrality parameter 



under P^T n -i/2 I/ (n) T (»). • Evaluation of the limit in (A.4) yields the first 
term in (5.4). 

As for the shape part, using again Lemma 4.1(h) and the fact that 
(r^-) 1 / 2 x Ptf^r,^-) 1 / 2 is symmetric and idempotent with rank ko(m — 
1), we obtain similarly that the second term in (A.3) is asymptotically chi- 
square with ko(m — 1) degrees of freedom under P#^ l5 $ 6 M(T), and 
asymptotically noncentral chi-square, still with ko(m — 1) degrees of free- 
dom but with noncentrality parameter 

(A.5) (^(i^fjimJCr^ 

under p ^ n _i/2 I/ (n) r (n). fll - Evaluation of the limit in (A.5) yields the second 
term in (5.4). As the two terms in (A.3) are asymptotically uncorrelated [see 
Lemma 4.1(h) again], they can indeed be treated separately. 

(ii) The fact that 0^ has asymptotic level a directly follows from the 

asymptotic null distribution in part (i) and the classical Helly-Bray theorem, 
(hi) Optimality is a consequence of the asymptotic equivalence (A.3), 

under g\ = /i(e F a ), of and the locally asymptotically optimal test 
statistic Qf, as described in (4.3). □ 

Proof of Corollary 5.1. (i) Fix g x e T ai with £ k (K,gi) / / 
J k (K,gi). Clearly, cpP is consistent under p ^ n -i/2 I/ (n) T (n). Jl7 $ G M(T) 
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iff the corresponding noncentrality parameter in (5.4) is nonzero. Assume 
the latter is zero. Then, the assumptions on g± imply that s 2 /\/X~i = s 2 /\fXii 
and 

for all Now, since tr(A 2 ) = implies that A = for any symmetric 

k X k matrix A, it follows from (A.6) that Vi/y/Xi = Vi'/y/Xi/ for all 
This is possible only for vt G .M(Y), which establishes the result, 
(ii) Going back to the definition of g\ \— > J k (K, g±), we have 

J k (K,g 1 ) = / K(G lk (r))r(p gi (r)gik(r)dr 
Jo 

1 POO 

= / K(G lk (r))(- gi {r))r k dr. 

Mfe-liSi Jo 

Integrating by parts, J k (K, 9l ) = f™[kK(G lk (r)) + K' (G lk (r))rg lk (r)] x 
g lk (r)dr = k 2 + J^K'(G lk (r))r(g lk (r)) 2 dr, so that fjj° K'(G lk (r)) x 
r{g\ k {r)) 2 dr > guarantees that £ k (K,gi) = J k (K,g\) — k > 0. The result 
follows from part (i) of the corollary. □ 

A. 2. Proof of Proposition 4.2. Consider an arbitrary value # = (fij, -d'jj, 

■d' in y = (0' 1; . . . , 0' m ,a 2 l' m , l' m (8> (vech V)')' G M(T) of the parameter and a 
(bounded) sequence of corresponding local perturbations $( n ' := # + 
w - 1 / 2 i/( n ) x T^ n \ where 

r (n) = (T (n)/ )T H/ )T (n)y 

= (<>>', . . . , t£>, . . . , (vech vf))', . . . , (vech v&>)')' 

is such that G .M (Y) for all n. To prove Proposition 4.2, it is sufficient 
to show that, under P& gi (where g\ is as in Proposition 4.2), 



(A.7) 



a, II a. II , £k{K,gi) ( n ) 

~ 4^ + — i^i — T // and 
Aj^. x - Agf + ^(tf, <7i)[I m ® H fe (V)]rg 



are op(l) as n — ► oo, since the local asymptotic discreteness of # (see, 
e.g., [25], Lemma 4.4) allows to replace the nonrandom quantity with 
the random one 1? in (A.7). Note that $ being constrained allows us to re- 
strict to #( n ) G M(T). Looking at block i (i G {1, . . . ,m}), Proposition 4.2 
thus is a corollary of the following result. 
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Proposition A.l. Assume that (A), (B) and (C) hold, and that gi G 
T a - Fix i9 G M(T) and a sequence G M(T) as above. Then, for all 
i = 1, . . . ,m, 

(A.8) a^-A^+^^-JW and 

■d;K 

A%l K - A. IH,i + J k (K, gi )H k (V)(vechvM) 

■&;K 

are op(l) under P^^, as n— ► oo. 

Proof. In this proof, we let 0£ := t + n~ 1/2 tj n) , V n := V + n~ 1/2 vf ] , 
and er£ := a 2 + n^ 1 ^ 2 sf^ [since "Q,-d^ G A4(Y), er 2 and V n do not depend 
on i, which explains the notation]. Accordingly, let Z?- := V _1 / 2 (Xjj — 

4 := ||Z9.||, UP- := %jd%, Z£ := (V»)-V2(Xy - 0?), := ||Z|||, and 
U™- := Zfj/dfj. Following an argument that goes back to [24], consider the 
following truncation of the score function K: for all i G No, define 

K {e) (u) :=ifQW«- jjl[i/£ <u <2/q +K(u)I [2 / e<u < 1 -2/e\ 



! [l-2/^<u<l-l/f]i 

where J4 denotes the indicator of A. Since u 1— ► i^(it) is continuous, the func- 
tions u 1 ^ KW(u) are also continuous on (0, 1). It follows that the truncated 
scores are bounded for all I. Clearly, it can safely be assumed that K is 
monotone increasing (rather than the difference of two monotone increasing 
functions), so that there exists some L such that < |-K"(it)| for all 

u G (0, 1) and all I > L. 

We start with the proof that the scale part of (A.8) is op(l) under Pj^- 
For the shape part, the result is an easy extension of the corresponding 
result in [14]; details are left to the reader. Lemma 4.1(i) shows that A^'J^ — 



K-Kigt is opC 1 )' under P Sx- Similarly, &%, K - ^%. K , gi is °p(!) under 
P^ji) — hence, from contiguity, also under Pj^- Consequently, (A.8) is 
asymptotically equivalent, under P^ gi , to 

Now, \nJ l,2 Y^%i{K{Gik{d r } j /o- n )) - k), under P^l) , is asymptotically 
normal as n — > oo, with mean zero and variance ^£fc(-RT), so that 3(^3- — 
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^)^" 1/2 E"=i(^(G'i fc (dr j K)) - k) is o P (l), as n-oo, under P% ;gi , as 
well as under P^) gi (from contiguity). Consequently, (A. 9) is asymptotically 
equivalent, under P^L, to 



-1/2 



2a 2 



3=1 



2d 2 



-1/2 



Y^(K(G lk (d%/a))-k) + 



3=1 



AC^j gl) 2(n) 

4a 4 Sl • 



Decompose c£ n) into c\ n) = u\f ] + Dg ; ' } - + Rg ; ' } + R&*> where, 

denoting by Eq expectation under PjJ/ 



91' 



D 



£1 



-^n- 1/2 ^(i^«(G lfe (4K))-E[ J ft:W (C/)]) 



2(J 2 



3=1 



D 



R 



'il 



2d 2 



n 



-1/2 



2(^ W (Gi*(4/(r))-E[^W(l7)]) 



3=1 



1 -V 2 T? 

n- E 



%2 



2a 2 



1 -V 2 T? 

2V n * E ° 



^(ifW(G lfc (4K))-E[^)(C/)]) 
Lj=l 



II, 



^(ifW(G lfe (4K))-E[^)(C/)]) 

■3=1 



A^^l) 2(n) 



4(T 4 



2^ 



1/2 



3=1 



(xW(G lfc (4/ f x))-E[A'W([/)])], 



R 



j2 



and 



2a 2 



3=1 



(^)(G lfc (£.K))-E[^)([/)])], 



R 



i3 ' 4a 4 

(n) 



■7— j(^fe(A,3i) — Ck{K^\gi))i 



2(n) 



We prove that =op(l) [thus completing the proof that (A. 8) is op(l) 
under P^J by establishing that and are op(l) under Pjj^, 
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as oo, for fixed £, and that R^ \ R^ ana - are °p(1) un der 

the same sequence of hypotheses, as t — > oo, uniformly in n. For the sake 
of convenience, these three results are treated separately (Lemmas A.l, A. 2 
and A.3). 

Lemma A.l. For any fixed i, Eo[|D^'^| 2 ] = o(l) as ?t,^oo. 
Lemma A. 2. For any fixed t, = o(l) as n — > oo . 

Lemma A.3. As I — > oo, uniformly in n , (i) Rg™ isop(l) under ^fi) gi , 

(ii) R^2 «s op(l) under pg /or n sufficiently large and (iii) R^j «s 
o(l). 

Proof of Lemma A.l. First note that Dj™ ;£) = ^n^ 2 E^ijT^ - 
E [T^% where } := ^)(G lfc (^/o- n ))-if i = 1, . . . ,n, 

are i.i.d. Writing Varg for variances under P^.^, 

EoflDrf] = VarolD^] = ^ Var [T t ^] < ^EoIIT^f], 
and it only remains to show that, as n — > oo, 

E [|T^| 2 ] = E [(K^(G lk (d^/a n )) - (G lk (d° a /a))) 2 } 

(A.10) 

= (i) 

Now, \d&/a n - 4Ja\ < \d& - d° a \/a n + \a~ l - a' 1 ^ is o P (l) under pg x 
since \df± — d^J is; see Lemma A.l in [14]. This and the continuity of K^oG\ k 
imply that K^{G lk {d^/a n )) - K^{G lk {d^/a)) = o P (l) under pg x , as 

n— > oo. Since K^> is bounded, this convergence also holds in quadratic 
mean, which entails (A. 10). □ 

Proof of Lemma A. 2. Letting 

one can show that, under P^^, as n — > oo, 

(A.11) B^^A^O^Var^V)])- 
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(n) 

Under the sequence of local alternatives P^ ( „) , as n — > oo , 



t n .£) C k (K^\gi) 2 (n) c 
BJi — s.- v — i 



4cj 4 



Defining := ^nj 1 ' 2 £^i(A^)(G lfc (^/a n )) -E[ifW(C7)]), it follows 



(n) 

from ULAN that, under , as n — > oo, 



(A.12) + A( ^' gl) 4 (n) ^ AA(o, ^C*(* W )) • 

Now, from (A.ll) and the fact that, under pg i? = -bJ" ;<) - 

Eq[B^2' ] = op(l) (Lemma A.l), we obtain that, under P^^, as n —> oo, 

(A.13) B<2*> -E [Br } ] -^^(o,^jA(A-W)). 

The result then follows from comparing (A.12) and (A.13). □ 

We now complete the proof that (A. 8) is op(l) under P^ x by proving 
Lemma A. 3. 

PROOF of Lemma A. 3. (i) In view of the independence under P^^ of 
the d?.'s, 

EoN^i 2 ] = ^E E o[[(^(Gi fc (4/^)) - k ) 



4ct 4 . 

3=1 



(^)(G lfc «./ C 7))-E[^W(C/)])] 5 



(A.14) = _i_ V ar[K([/)-A^)([/)] 



^^EK^C/)-^)^)) 2 ] 



4a 4 7o 



[ir(u)-iirW(u)] 2 du 



for all n. Clearly, K^'(u) converges to K(u), for all u £ (0,1). Also, since 
is bounded by |lf («)|, for all £ > L, the integrand in (A.14) is 
bounded (uniformly in £) by 4| ^^^(li) | 2 , which is integrable on (0,1). The 

Lebesgue dominated convergence theorem thus implies that Eo[|Rj-"'^| 2 ] = 
o(l), as I— > oo, uniformly in n. 
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(ii) Claim (ii) is the same as (i), with c&/cr n replacing d^/a. Accord- 
ingly, (ii) holds under P^i) • That it also holds under P& gi follows from 



Lemma 3.5 in [24]. 

(iii) Note that \C k (K, gi ) - C k {K^, gi )\ 2 = \ Cov[K(U) - K^(U), 
K gi (U)}\ 2 < C k (gi) x Vtu[K(U) - K®(U)\, which is o(l) as oo [see (i) 
above]. The result then follows from the boundedness of (s? ). □ 
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